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We propose a local strategy for constructing scale-free networks of arbitrary degree distributions, 
based on the redirection method of Krapivsky and Redner [Phys. Rev. E 63, 066123 (2001)]. Our 
method includes a set of external parameters that can be tuned at will to match detailed behavior 
at small degree k, in addition to the scale-free power-law tail signature at large k. The choice of 
parameters determines other network characteristics, such as the degree of clustering. The method 
is local in that addition of a new node requires knowledge of only the immediate environs of the 
(randomly selected) node to which it is attached. (Global strategies require information on finite 
fractions of the growing net.) 

PACS numbers: 02.50.Cw, 05.40.-a, 05.50.+q, 89.75.Hc 



Recently much effort has been devoted to the study 
of large networks that surround us in everyday life, such 
as the Internet and the World Wide Web, the electricity 
power grid, flight connections, social networks of contacts 
or collaborations, networks of predator-prey, of neurons 
in the brain, etc., LJ L| An important realization is 
that a majority of these networks share some character- 
istic properties: A small diameter — a small number of 
links connects between any two nodes on the net; clus- 
tering, or the small world property — the neighbors of a 
node tend to be connected to one another; and a scale- 
free degree distribution — the distribution of the num- 
ber of links emanating from a node (the degree k) has a 
power-law tail of the form 



P(k) 



.-A 



(1) 



The scale-free property gives rise to exotic behavior of 
the networks, such as resilience to random dilution (the 
percolation transition does not take place for A < 3), 
on the one hand, and high vulnerability to removal of 
the most connected nodes, on the other hand, and has 
become a principal focus of attention 0, Q . 

Several growth models that produce scale-free net- 
works have been suggested. However, most growth tech- 
niques rely on global properties of the network. Such is 
the case, for example, for the seminal model of Barabasi 
and Albert (BA) |(J, where new nodes are connected to 
an existing node with a probability proportional to its 
degree. The BA algorithm requires global knowledge 
of the degree of all present nodes. While global algo- 
rithms have contributed immensely to our understanding 
of how scale-free degree distributions might emerge, in 
most common situations it is more likely that networks 
evolve by a set of local rules. (One does not typically 
conduct a survey of the Internet for deciding where to 
connect a new router, neither does one study a whole net- 
work of social contacts for selecting new acquaintances.) 
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FIG. 1: The KR redirection model. A new node n (solid) 
is connected to a randomly selected node x with probability 
1 — r (dashed arrow), or the link is redirected to node y, the 
ancestor of x, with probability r (solid thick arrow). After 
Krapivsky and Redner 0. 



Here we propose a local strategy for constructing scale- 
free networks, based on the redirection technique of 
Krapivsky and Redner (KR) 0. Our method includes 
a set of external parameters that enable fine tuning of 
additional properties (degree distribution at small k, de- 
gree of clustering, etc.), while guaranteeing a scale- free 
tail with a given degree distribution exponent, as in the 
original KR method. 

In the KR redirection model, a new node n is con- 
nected to a pre-existing node x, selected at random (with- 
out regard to its degree), with probability 1 — r. If the 
connection is established, x is said to be the ancestor of 
n (the link may be regarded as directed, from n to x). 
With probability r, the connection is not realized, but 
instead it is redirected to y, the ancestor of x (Fig. [IJ. 

The redirection step is essentially what turns the global 
BA strategy of selecting a node with a probability pro- 
portional to its degree, to a local one. (See y|, for an 
early introduction of this basic principle.) Each node 
has but one ancestor, so a node of degree k has k — 1 
incoming links. It follows that an ancestor of degree k 
is reached with a probability proportional to k — 1, but 
purely through a local process. The KR technique builds 
scale- free graphs with degree exponent A = 1 + 1/r (that 
is, 2 < A < oo). The BA net corresponds to the special 
case of r = 1/2 (A = 3). However, the obtained graphs 
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are always trees — they possess no cycles (a unique path 
connects between any two nodes) — and not much else 
can be controlled during the growth process, beyond the 
degree distribution exponent. 

We generalize the redirection algorithm as follows. At 
each time step the new node n is connected to m different 
pre-existing nodes, with probability p m , m — 1,2,..., 
(%2 m Pm = !)■ Each of the m nodes is selected randomly 
and independently by the same redirection trick as for a 
single node: direct connection to the selected node with 
probability 1 — r, or redirection to a random ancestor, 
with probability r. The KR model corresponds to the 
choice p m — 8 m> \. The presence of an initial network 
seed, of No nodes, is implied. The seed allows for the 
process to get started. 

A new node is added at each time step, so the total 
number of nodes increases as 



N(t) = N +t 



(2) 



In the long time asymptotic limit of t ^> Nq, N(t) ~ t. 
Because each new node has m ancestors with probability 
p m , the average number of ancestors of a node at time t 
is 



(m)(t) = 



Ao(m) +tJ2mPn 
N +t 



(3) 



where (m)o is the initial average number of ancestors 
present in the seed. In the long time asymptotic limit, 
(m) ~ m Pm = ci- The average degree of a node (in- 
cluding both incoming and outgoing links) is 



(k) = 2(m) ~ 2a. 



(4) 



Consider now Njp (t), the number of nodes of degree k 
that possess exactly I ancestors. It evolves according to 



dt 



Nl l) = 0, 



k < I. 



(5) 



The first term inside the curly brackets describes gains 
and losses due to direct connections (occurring with prob- 
ability 1 — r) , while the additional terms refer to changes 
due to redirection: since the nodes in question have I 
ancestors, the probability to reach a node of degree k 
by redirection is now proportional to k — I, instead of 
k — 1, as in the original KR model. Note that the num- 
ber of ancestors of a node is fixed at birth, and does not 
change subsequently. The rate for directed connections 
is normalized by Mq = ^ z ^2 k N k = N ~ t, while for 
redirected events, 

M = E E( fc - = N (( k ) - E ^rO- ( 6 ) 



In the long time asymptotic limit N^/N — > pi, hence 
M~ N((k) -a) ~a* [|. 

Due to the time dependence of the rates in Eq. JSJ, 
analytic integration is not feasible. Instead, we argue 
that the transient time does not affect the ultimate dis- 
tribution appreciably, and replace Mq and M by their 
long time asymptotic limits. From here we proceed as 
in |7j. The asymptotic equations admit the general solu- 



tion N ( k L) (t) 



n k t, where 



H' = a ( 1 - r )K-i 



-r[{k - I 



,(0 



0, 



k<l. (7) 



Note that n k is in fact normalized (since N ~ t) and 
represents the probability that a randomly selected node 
be of degree k and have I ancestors. 



The solution of the recursions 0) is 



,(') 



r)/(rk' + 1 + a(l — r) — rl)]. Writing the product as the 
exponential of a sum, expanding to first order in 1 jk' (for 
large k'), and approximating the sum by an integral, one 
finally obtains 



,(0 



fe -(l+l/r) < 



(8) 



The same is true for the overall degree distribution (with- 
out reference to the number of ancestors), 



P{k) = n k 



E 

1=1 



,(0 



(9) 



This follows from Eqs. J7J, summing over I, noting that 

J2i ~ a ' ano - proceeding as for n k \ 

We now have at our disposal a simple local algorithm 
that produces scale-free graphs of any desired degree ex- 
ponent A = 1 + l/V, as well as a set of parameters (the 
{p m }) that can be tweaked at will to achieve a wide 
gamut of additional attributes, without affecting A. Ob- 
vious examples include the average number of outgoing 
links from a node, (to) = ^2 mp m = a, and the average 
node degree, (k) — 2a. (Note that in the original KR 
model a = 1 is the only possible outcome.) 

The degree distribution at small k is determined by 
the particular choice of the {p m }- This is dramatically 
illustrated in Fig. [21 where we show the degree distribu- 
tion of a network grown with r — 1/2 and p m = 1/100 
for m = 1, 2, ... , 100 (p m = for m > 100). In addition 
to the expected k~ 3 tail, we see a very distinct distribu- 
tion, for k < 100. The resulting distribution is perfectly 
reproduced, analytically, using Eqs. 0. 

More importantly, the inverse problem is easily solved 
as well. Given a specific distribution P(k), with a scale- 
free tail ~ k~ x , it can be produced by the redirection 
method in the following way. Set r = 1/(A — 1) for the 
redirection probability, thus ensuring the desired scale- 
free tail. Compute a = (1/2) kP(k) that is implied by 
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FIG. 2: Degree distribution of network constructed with 
p m = 1/100, m = 1,2, ...,100, and r = 0.5 (o). The ana- 
lytical prediction from Eqs. J7J and © (solid line) is shown 
for comparison. 




1 ' 1 ' 1 ' 1 ' 1 ' 1 

U 'T) 0.2 0.4 0.6 0.8 1 
P 

FIG. 4: Clustering index, C, for nets constructed in the redi- 
rection method, with r = 0.1, 0.2, . . . , 0.9 (bottom to top) and 
Pm = (1 - p)Sm.i + p5 m ,2- Plotted is C/Cer, where Cer is 
the clustering index of the equivalent ER graphs 0, ITo| . 
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FIG. 3: Convergence to a target distribution (o) for nets 
grown with the analytically computed p m 's (inset). Shown 
are distributions obtained for networks grown with just the 
first 4, 8, and 12 p m 's (solid curves). Convergence to the 
target distribution improves with increasing range of m. 



the desired distribution. According to Eq. J7J), 



Pm = [1 + o(l - r)\ 



,( m ) 



(10) 



The required rVm are computed iteratively, from Eqs. Q 

P(l). Using this, and 
(i) 



and ©. From © we have 

Eqs. 0, for k = 2, I = 1, one can compute then 



obtain n 



(2) 



P(2) 



from ijjjj, etc. 



We have followed this procedure to compute the p m 
needed to grow a network with the target distribution 
P[k) — c,2c,c,2c, for k = 1,2,3,4, respectively, and 
P{k) = 2c(fc/4)" 2 - 5 , for k > 4, (c = 1/10.4359, is de- 
termined from normalization: Y)P(k) = 1), shown in 
Fig- El As might be expected, the p m decay rapidly with 



m (in this particular 



-2.5 



) and a reasonably 



well-fitting distribution is obtained with just m < 4. In- 
creasing the range of to yields a better fit to the target. 
Alternatively, a better fit can be achieved, for a fixed 
range of to, by treating a as a variable whose value is 
then optimized. 

Another interesting property is the number and dis- 



tribution of loops, or cycles, in the growing nets. One 
measure of cycles is provided by the clustering index, de- 
fined for site i as [Tl| 



d = 



Ei 



2 ki (ki 



1) 



(11) 



It denotes the ratio between Ei, the actual number of 
links between the ki neighbors of the node, to the max- 
imum number of links that would result had all the ki 
nodes been connected to one another. For trees, the 
clustering index is zero at all nodes. In contrast, the 
proposed generalized KR process can produce loops, and 
hence nonzero clustering, whenever a new node is con- 
nected to more than one site: pi < 1 (or p m > for 
some to > 1). 

To test this issue we focused on the KR model, but 
where a newly added node is further connected to a sec- 
ond node with probability p: p\ = 1 — p, pi = p (and 
p rn = for to > 2). Several networks were grown in this 
way, with different values of r and p. The numerical data 
strongly support the predicted relation A = 1 + 1/r and 
the independence from the p rn . 

The results for clustering are summarized in Figs. 01 
and [SI In Fig. we plot the ratio of the global cluster 
coefficient (averaged over all the sites of the network), 
C, to the cluster coefficient of an equivalent Erdos-Reniy 
(ER) graph 0,^3, Cer, as a function of p, for various 
values of r. As might be expected, the clustering in- 
dex increases with p, though the effect saturates rather 
quickly. A more pronounced effect is achieved by increas- 
ing r, which changes C by orders of magnitude (note the 
logarithmic scale) . This has to do with the influence of r 
on the degree distribution. To see that, in Fig.EJwe plot, 
for the same data, the ratio of C to C ran d — the clus- 
tering index in equivalent networks, with identical scale- 
free degree distributions, but where all the connections 
have been redistributed randomly. The dependence on 
r seems a lot weaker by this comparison. Fig. El demon- 
strates, however, significant differences between random 
scale-free networks and nets constructed by the general- 
ized redirection method. 
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FIG. 5: Clustering index for the same data as in Fig. E] but 
compared to equivalent scale-free random graphs (see text). 
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FIG. 6: Degree distribution of "self-consistent" net grown 
with pm = P(ra,i). The distribution peaks at a value of 
k that increases with time, while its tail might be possibly 
interpreted as scale- free (inset): A = l + l/r = 3is consistent 
with our noisy data. 



the p m to vary with time. Consider, for example, the 
"self-consistent" networks grown with p m (t) = P(m,t), 
that is, where the p m reflect the existing degree distri- 
bution at time t. The degree distribution that results 
from this strategy, using r = 1/2, is shown in Fig. [S] We 
have not attempted analyzing this distribution, because 
the equations involved are quite more complicated, and 
we cannot think of any serious applications to this cu- 
rious model. Still, it remains an amusing problem that 
might test the limits of the general approach of evolution 
equations. 

We have presented a local strategy — the generalized 
redirection method — for the growth of scale-free net- 
works. Local algorithms require information about only 
the immediate environs of a visited node, and are more 
likely to reflect real growth processes than global algo- 
rithms, where addition of new nodes requires informa- 
tion about the whole network. The generalized redirec- 
tion technique includes a class of external parameters, 
the p m , that may be tuned at will, without affecting the 
degree exponent (which is controlled by the independent 
parameter r), to achieve a variety of effects. We have ex- 
plicitly demonstrated how to design nets with arbitrary 
degree distributions, at small degree k, and power-law 
(scale-free) tails, at large k. We have also shown that 
additional attributes, such as the degree of clustering, 
can be manipulated by a proper choice of the model's 
parameters. However, how to do this effectively remains 
an open question. We anticipate that analogous algo- 
rithms to the one presented for the degree distribution 
can be designed for other attributes as well, by studying 
the relevant rate equations. 



As a final remark, we note that we have tacitly as- 
sumed that the set {p m } is finite. Indeed, were the set 
infinite, there would be no way to even start the growth 
process, unless one had an infinitely large seed as well, 
since connecting a node to m random nodes requires that 
there be at least m nodes present to begin with. One way 
to get around the limitation to finite sets, is by allowing 
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